LLMs on a Shoestring: The Dynamic Cache Advantage by Arvind Sundararajan
dev.to·9h·
Discuss: DEV
💾Cache Algorithms
Semantic Dictionary Encoding
falvotech.com·4h·
Discuss: Hacker News
🗂️Type Indexing
Rowhammer: TRR on DDR5 DRAM has been broken
comsec.ethz.ch·2h·
Discuss: Hacker News
🏷️Memory Tagging
Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)
semiengineering.com·2h
🗺️Region Inference
What is Algebraic about Algebraic Effects?
interjectedfuture.com·3h
💫Effect Systems
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·15h·
🌱Minimal ML
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·13m·
Discuss: r/programming
🚀Tokenizer Performance
A Slotted Hash Cons for Alpha Invariance
philipzucker.com·39m·
Discuss: Hacker News
🔗Lexical Scoping
LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·15h
🗺️Region Inference
A (Nearly) Branchless RESP Request Parser
kevinmontrose.com·7h
🔧Error Recovery
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·7h·
Discuss: r/LocalLLaMA
🗺️Region Inference
The future of microoptimization
goldenstack.net·2d·
Discuss: Hacker News
🔬Nanopasses
Writing an operating system kernel from scratch
popovicu.com·2d·
🖥️Minimal VMs
Power Query Secret Tip to Lightning-Fast Approximate Matches
geeky-gadgets.com·6h
📊Query Optimizers
More hardware won’t fix bad engineering
infoworld.com·10h
🔮Branch Predictors
H100 PCIe – 1.86 TB/s memcpy roofline and 8× uplift
news.ycombinator.com·1d·
Discuss: Hacker News
🧠Memory Hierarchy
LLM Rerankers for RAG: A Practical Guide
fin.ai·21h·
🪜Recursive Descent
Crashes are loud. Leaks are quiet.
blog.bitdrift.io·19h
🧠Memory Ordering
Balance between refactoring and inheritance in your code
github.com·7h·
Discuss: Hacker News
🧪Compiler Testing